Amendments to the Claims: 



This listing of claims will replace all prior versions, and listings, of claims in the application: 
Listing of Claims: 

1. (Previously presented) A processor-implemented method of clustering a set of records, 
each of the records having attribute values for a set of attributes, the method comprising: 

for each attribute of the set of attributes, determining a characteristic value for said each 
attribute, based on attribute values of said each attribute; 

wherein determining the characteristic value comprises calculating the attribute values of 
said attribute across the records; 

for each attribute value, determining a deviation from the characteristic value of said each 
attribute; 

for each record, sorting the set of attributes based on deviations of the attribute values, to 
provide a key; and 

clustering the set of records based on the key into a clustering result that includes a 
plurality of clusters; 

wherein the key comprises an ordered list of the set of attributes and the deviations; and 
refining the clustering result by selectively changing a length of the key to change the 
number of the clusters. 

2. (Original) The method of claim 1, further comprising calculating a mean value of the 
attribute values of said each attribute as the characteristic value. 

3. (Original) The method of claim 1, wherein a median value of the attribute values of said 
each attribute is determined as the characteristic value. 

4. (Original) The method of claim 1, wherein determining the deviation comprises 
calculating a difference between said each attribute value and the characteristic value of said 
each attribute. 
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5. (Original) The method of claim 1, wherein determining the deviation comprises 
calculating a difference between said each attribute value and the characteristic value of the 
corresponding attribute, and dividing the difference by the characteristic value of said each 
attribute. 

6. (Original) The method of claim 1, wherein sorting the set of attributes comprises using 
absolute values of the deviations of the attribute values as a sorting criterion. 

7. (Original) The method of claim 1, wherein a first record of the set of records contains a 
first key and a second record of the set of records contains a second key; and 

further comprising placing the first key and the second key into a single cluster if the first 
key and the second key have identical subsequences of a first length. 

8. (Original) The method of claim 1 , wherein a first record of the set of records contains a 
first key and a second record of the set of records contains a second key; and 

further comprising placing the first key and the second key into a single cluster if the first 
key and the second key have identical subsequences of absolute values of the deviations. 

9. (Original) The method of claim 1, wherein a first record of the set of records contains a 
first key that has a first sub-sequence, and a second record has a second sub-sequence contains a 
second key; and 

further comprising placing the first key and the second key into a single cluster if the first 
and second sub-sequences comprise the same set of attributes. 

10. (Original) The method of claim 9, wherein the first and second subsequences comprise 
the same set of attributes irrespective of a sign of the deviations of the attribute values. 

11. (Previously Presented) The method of claim 10, wherein refining the clustering result 
comprises: 

identifying a cluster having a smallest number of records; 

for each record of the identified cluster searching another cluster having records with best 
matching keys; and 

distributing the cluster with the smallest number of records to the other cluster having 
records with best matching keys, to reduce the total number of clusters. 
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12. (Original) The method of claim 1 1 , further comprising reducing a length of the first sub- 
sequence and a length of the second sub-sequence in order to find a best match. 

13. (Original) The method of claim 12, further comprising using a distance measure to find 
another cluster for a record of the identified cluster. 

14. (Previously presented) The method of claim 13, wherein the distance measure comprises 
a Euclidean distance. 

15. (Previously Presented) A computer program product having executable instruction codes 
that are stored on a computer-readable medium, for clustering a set of records, each of the 
records having attribute values for a set of attributes, the computer program product comprising: 

a set of instruction codes, which, for each attribute of the set of attributes, determines a 
characteristic value for said each attribute, based on attribute values of said each attribute; 

wherein the characteristic value is calculated from the attribute values of said attribute 
across the records; 

a set of instruction codes, which, for each attribute value, determines a deviation from the 
characteristic value of said each attribute; 

a set on instruction codes, which, for each record, sorts the set of attributes based on 
deviations of the attribute values, to provide a key; and 

a set of instruction codes for clustering the set of records based on the key into a 
clustering result that includes a plurality of clusters; 

wherein the key comprises an ordered list of the set of attributes and the deviations; and 

a set of instruction codes for refining the clustering result by selectively changing a 
length of the key to change the number of the clusters. 

16. (Previously Presented) The computer program product of claim 15, further comprising a 
set of instruction codes for calculating a mean value of the attribute values of said each attribute 
as the characteristic value. 

17. (Previously Presented) The computer program product of claim 15, further comprising a 
set of instruction codes for setting a median value of the attribute values of said each attribute as 
the characteristic value. 
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18. (Previously Presented) The computer program product of claim 15, wherein the set of 
instruction codes for determining the deviation from the characteristic value determines the 
deviation by calculating a difference between said each attribute value and the characteristic 
value of said each attribute. 

19. (Previously Presented) The computer program product of claim 15, wherein the set of 
instruction codes for determining the deviation from the characteristic value determines the 
deviation by calculating a difference between said each attribute value and the characteristic 
value of the corresponding attribute, and by dividing the difference by the characteristic value of 
said each attribute. 

20. (Previously Presented) The computer program product of claim 15, wherein the set of 
instruction codes for sorting the set of attributes sorts the set of attributes using absolute values 
of the deviations of the attribute values as a sorting criterion. 

21. (Previously presented) A processor-implemented system for clustering a set of records, 
each of the records having attribute values for a set of attributes, the system comprising: 

each attribute of the set of attributes comprising a characteristic value for said each 
attribute based on attribute values of said each attribute; 

wherein the characteristic value is calculated from the attribute values of said attribute 
across the records; 

each attribute value comprising a deviation from the characteristic value of said each 
attribute; 

each record comprising the set of attributes based on deviations of the attribute values, to 
provide a key; and 

wherein the set of records are clustered based on the key into a clustering result into a 

clustering result that includes a plurality of clusters; 

wherein the key comprises an ordered list of the set of attributes and the deviations; and 
means for refining the clustering result by selectively changing a length of the key to 

change the number of the clusters. 

22. (Original) The system of claim 21, wherein a mean value of the attribute values of said 
each attribute is calculated as the characteristic value. 
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23. (Original) The system of claim 21, wherein a median value of the attribute values of said 
each attribute is calculated as the characteristic value. 

24. (Original) The system of claim 21, wherein the deviation is calculated as a difference 
between said each attribute value and the characteristic value of said each attribute. 

25. (Original) The system of claim 21, wherein the deviation is determined by calculating a 
difference between said each attribute value and the characteristic value of the corresponding 
attribute, and by dividing the difference by the characteristic value of said each attribute. 

26. (Original) The system of claim 21, wherein the set of attributes is sorted using absolute 
values of the deviations of the attribute values as a sorting criterion. 
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